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COMPOSITIONS AND METHODS FOR RECOMBINANT PRODUCTION OF CROTALIDUS VENOM 

FIBROLASE 



■ Technical Field 

This Invention is In the fields of enzyme chemistry, genetic engineering, and thrombolytic therapy. More 
5 particularty, it relates to the recombinant production of the fibrinolytic enzymes, present In the venom of 
snakes of the family Crotalidae, such as Agklstrodon contortrix contortrix (southern copperhead) fibrolase. 



Background Art 

Thrombolytic therapy is an established procedure for treatment of various thromboembolic conditions 
such as pulmonary embolism, thrombophlebitis, and arterial thromboembolism. Many of the thrombolytic 
agents being Investigated act Indirectly by activating plasminogen throughout the circulation and are, 
therefore, not directed speclfteaily towards the thrombus. These plasmiriogen-activating fibrinolytic agents 

/5 have perceived disadvantages in therapy, such as Inducing sufficient plasmin to deplete clotting factors to 
levels that enhance the probability of hemonrhaglc complications after thrombolytte therapy. These per- 
ceived problems have led to the investigation of direct-acting thrombolytic agents, such as the fibrinolytic 
enzymes present Inr many snake venoms. 

Numerous direct acting fibrinolytic enzymes have been identified in snake venoms. Among those 

20 isolated from Crotalidus venom are southern copperhead fibrolase, tfie purification and characterization of 
which is described in U.S. patent No. 4.610,879 and green pit viper fibrolase described In EPA publication 
no. 0020780. Preliminary investigation of soutiiem copperhead fibrolase indicates it offers promise as a 
safe, effective, direct-acting agent for thrombolysis. Despite the tiierapeutic promise of tills agent, however, 
it will be impractical to canv out extended clinical Investigation of native fibrolase because of the scarcity of 

25 the source venom.' Also fibrolase purified from venom pooled from a number of snakes Is likely to be 
heterogenous due to allelic'polymorphic variation between Individual snakes. A principal purpose of the 
present Invention is to provide a more- practical means for obtaining homogenous Crotaildus fibrolases tiian 
purifying them from venom. 

30 

Disclosure of tiie Invention 

Accordingly, one aspect of tiie invention is recombinant DNA encoding a Crotaildus fibrolase. such as 
soutiiem copperiiead fibrolase. 
35 Cloning and expression vectors containing such recombinant DNA are another aspect of the invention. 

Hosts such as transformed bacteria, yeast and mammalian cells which contain such expression vectors 
and are capable of producing active recombinant Crotaildus fibrolase are another aspect of the invention. 

Mettiods for producing Crotaildus fibrolase which employ such hosts are still another aspect of the 
invention. 

40 Recombinant homogeneous Crotaildus fibrolase whteh is free of ottier Crotaildus proteins Is a further 

aspect of the invention. ^ 



Brief Description of the Drawings 

45 

Figure 1 shows the nucleotide sequence and the deduced amino acid sequence of a southern 
copperhead fibrolase clone designated SVQ-48 (or FIB 48). 

Figure 2 shows the partial nucleotide sequence and the deduced partial amino acid sequence of 
another southern copperhead fibrolase clone designated SVQ-51 (or FIB 51). 
50 Figure 3 shows a comparison of the nucleotide sequences of the coding strands of ttie fibrolases of 

Rgures 1 and 2. Nucleotide variances between the sequences are asterisked. . 

Rgure 4 shows a comparison of the amino acid sequences (using • single letter amino acid 
designations) of the fibrolases of Figures 1 and 2. Amino acid variances between the two sequences are 
asterisked. 
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Rgure 5 Is a flow diagram depicting a scheme for preparing the yeast expression plasmids 
pAB24F248 or pAB24F251 which can be used to produce southern copperhead fibrolase in yeast. 

Rgure 6 shows the nucleotide sequence of the 1341 bp ADH2-QAPDH promoter fragment refen-ed to 

in the Examples. 

5 Rgure 7 shows the nucleotide sequence and amino acid sequence of an o-factor profibrolase fusion 

construct with a Lys-Arg processing site at the pro-mature junction (Example 5, Infra). 

Rgures 8. 9, 10 and 11 are the nucleotide and corresponding amino acid sequences of the a-factor 
profibrolase fusion constructs of plasmids pKS308, pKS311, pKS314 and pKS317 of Example 5, Infra. 

10 

Modes for Carrying Out the Invention 
1. Definitions 

75 

The term "recombinant" as used herein to characterize DNA encoding Crotalidus fibrolase Intends ONA 
of genomic. cDNA, semisynthetic, or synthetic origin which, by virtue of its origin or manipulation is (l) not 
associated with all or a portion of the DNA with which it is associated In nature or In the form of a library 
and/or (2) linked to DNA other than that to which it is linked in nature. "Recombinant" as used to describe 
20 Crotalidus fibrolase Intends protein produced from such DNA. 

A "replicon" Is any genetic element (e.g., a plasmid, a chromosome, a virus) that behaves as an 
autonomous unit of polynucleotide replication within a cell; i.e.. capable of replication under its own control. 

A "vector" Is a replicon In which another polynucleotide segment Is attached, so as to bring about the 
replication and/or expression of the attached segment. An "expression vector" refers to a vector capable of 
25 autonomous replication or Integration and contains control sequences which direct the transcription and 
translation of the southern copperhead fibrolase DNA in an appropriate host 

A "coding sequence" is a polynucleotide sequence which Is transcribed and/or translated Into a 
polypeptide. 

A "promoter sequence" is a DNA regulatory region capable of binding RNA polymerase and Initiating 
30 transcription of a downstream (i.e., In the 3' direction) coding sequence. 

A coding sequence Is ."under the control" of the promoter sequence In a cell when transcription of the 

coding sequence results from the binding of RNA polymerase to the promoter sequence; translation of the 

resulting mRNA then results in the polypeptide encoded within the coding sequence. 

"Operably linked" refers to a juxtaposition wherein the components are configured so as to perform 
35 their usual function. Thus, control sequences operably linked to a coding sequence are capable of effecting 

the expression of the coding sequence. 

"Control sequences" refers to those sequences which control the transcription and/or translation of the 

coding sequence(s); these may include, but are not limited to, promoter sequences, transcriptional Initiation 

and termination sequences, and translational Initiation and tennination sequences. In addition, "control 
40 sequences" refers to sequences whtoh control the processing of the polypeptide encoded witiiln the coding 

sequence; these may include, but are not limited to sequences controlling secretion, protease cleavage, and 

glycosylation of the polypeptide. 

"Transformation" Is ttie Introduction of an exogenous polynucleotide into a host cell. The exogenous 

polynucleotide may be maintained as a plasmid, or alternatively, may be integrated within the host genome. 
45 "Crotalldae" denotes tiie family of snakes commonly known as pit vipers. Members of the Crotalldae 

family Include species of tfie subfamilies AgWstrodon (A). Crotalus (C). Bothrops (B) and Trimeresurus (T). 

Examples of snake species of this family are A. acutus , A^ billneatus . A^ callginosus , A. contortrix . A. halys , 

A. halys blomhoffii. A. hypnale . A. mokasen , A. piscivorous , A. rhodostoma , A. saxatills . B. alternatus . B. 

SoxTb! billneatus , B. caribbaeus . B. godmanl . B. itapetinlngae . B. jararaca . B. jararacussu , B. lanceolatus . 
50 Briahsbergii. B. nasute , B. neuwiedi . B. nlgroviridlFmarchi . B. nummifer (B. nummlfera ). B. schlegelii , C. 

adamanteus. 67 atrox . C7 "bastiiscus . C. cerastes . C. confluentus . C, durissus . C. durissus terriflcus . C. 

horridus. C. Ieplduirc." 7nltchellll, C. molossus , C. ruber. C. scutulatus . C. tigrls, C. unicolor . C. viridls 

lutosus, cTviridis oreganus. L. mutiJs . S. catenatus . S. miliarius , S. ravus . 1\ albolabris . 'L elegans . T. 

fiavovlridisi T. gramineus . T7 monticola7 T. mucrosquamatus , T\ oklnavensls . T^ popeoTum . T pur; 
55 pureomaculafajs. T. stejnegeriTT. tokarensis . T. wagleri . 



2. Recombinant Southern Copperhead DNA 
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Figures 1-4 show the nucleotide sequences and deduced amino acid sequences of two southern 
copperhead fibrolase clones, designated pSVQ-48 (or Rb 48) and pSVG-51 (or Rb 51). Fib 48 includes the 
complete sequence for southern copperhead preproflbrolase whereas Rb 51 lacks a short 5 sequence and 
initiator codon. 

5 Based on the Fib 48 and Rb 51 sequences and amino acid sequences it is apparent that there are 
polymorphic variations and/or allelic variations of southern copperhead fibroiase. Amino acid sequencing of 
native protein shows different residues at amino acids 210 (Tyr — Asn) and 360 (ThrA/ai — Met) (Figure 4) 
from the sequences predicted from the two clones thus providing further evidence of such variation. 

Further amino acid sequencing of native protein provided an amino acid sequence for the mature 

10 fibroiase protein differing by two amino acids from the sequence predicted from Fib 51 (Rgures 2 and 4). 
The six differences are (numbering begins at amino acid 185 of Rgure 2): 123 (Pro — Ser) and 170 (Vai — 
Met). 

This sequence Is sometimes refenred to herein as the "corrected native sequence". 

It is further expecte?! that fibrolases from otfier Crotalldus species may also vary intraspecies as well as 

15 varying from species to species but wlii exhibit significant homology (i.e., 30% or more identity, more 
usually 50% or more Identity, in amino acid sequence to the southern copperhead fibroiase sequences 
described herein). Rbrolase DNA of Crotalidae of species other than southern copperiiead fibroiase may be 
identified and Isolated as described below. Amino acid sequences may be deduced from that DNA. 

The recombinant southern coppertiead DNA of the invention encodes at least amino acids 192 to 393 of 

20 the Fib 48 sequence of Figure 4. or the corresponding amino acids of the Fib 51 sequence of Rgure 4 
(amino acids 186-387 of the Rb 51 sequence), or the corrected native amino acid sequence shown in 
Figure 7. and analogs of those amino acid sequences which are substantially homologous and functionally 
equivalent thereto. Based on the sequences of clones pSVQ-48 and pSVG-51 shown in Rgures 1 and 2 and 
amino acid sequencing of native southern copperhead fibroiase the stmcture of southern copperhead 

25 fibroiase Is believed to be that of a prepropolypeptide with the first 190-191 amino acids of the Rb 48- 
sequence being a leader sequence and the mature protein beginning at the Gin at position 190 or the Gin 
at position 191 (Rgure 1) or at 185/186 in the Fib 51 sequence (Rgure 2). it is further believed that the 
carfaoxy terminus of the prepropolypeptide is processed to remove the final 18 amino acids. Mass 
spectrophotometric analysis of peptides derived from mature native southern copperhead fibroiase indicates 

30 the amino terminus is a cyciized glutamine residue with some molecules starting at the Gin at 185 (i.e., 
pGIu-QIn-Arg/Phe-...) and other molecules starting at the Gin at 186 (i.e.. pQlu-Arg/Phe-.,.) of Figure 2. 
Reaction of mature native fibroiase witii DTNB indicated there are no free sulfhydryl groups In southern 
copperhead fibroiase meaning there are three disulfide bonds in the molecule. The term "substantially 
homologous" as used herein intends to Include such variations polymorphic and/or allelic variations as well 

35 as other variations that do not destroy the fibroiase activity of the molecule, in general, the homology in 
amino acid sequence will be at least about 70%, more usually at least about 75%. The term "functionally 
equivalent" Intends that the sequence of the analog defines a protein having the biological activity of 
fibroiase (as measured by the azocasein or fiuorometric assays described in the examples). The sequence 
may include a portion or ail of the leader sequence (amino acids 1-190/191 and all or a portion of the 

40 processed cartDoxy terminus of the prepropolypeptide (amino acids 393-411 of Fib 48 or the corresponding 
amino acids of Fib 51). 

The recombinant fibroiase DNA may be genomic, cDNA or synthetic DNA, By way of example, the 
sequences shown in Figures 1 and 2 were obtained from a cDNA library prepared from mRNA obtained 
from southern copperhead venom glands. The library was screened with cDNA probes based on abundant 

45^ mRNA sequences and clones showing strong hybridization were selected. Identified fibroiase clones may in 
tum be used to screen genomic or cDNA southem copperhead libraries to obtain allelic or polymorphic 
variants of the fibroiase genes shown in the drawings or to screen libraries of other Crotalidus species to 
identify the fibroiase genes contained therein. A DNA sequence encoding the "native" protein sequence 
may be made by site specific mutagenis of clone 51 cDNA or by other conventional methods. The general 

50 procedures used to prepare probes based on the Identified clones, to prepare other libraries, and to screen 
those libraries for Crotalidus fibroiase sequences are known in the art and do not require elaboration. Based 
on the amino acid sequences deduced from the illustrated DNA sequences, synthetic genes encoding 
fibroiase may be prepared in vitro by synthesizing individual overiapping complementary oligonucleotides 
and filling in single stranded nonoverlapplng portions using DNA polymerase in the . presence of the 

55 deoxyribonucieotide triphosphates. For expression In a particular organism. It may be desirable to use a 
synthetic DNA sequence that employs codons preferred by tiie particular host In which the DNA is 
expressed. Mutations of the genes may be made by site specific mutagenesis. Such mutations may be 
used to. make fibroiase anabgs. 
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3. Cloning of Rbrolase DNA 

The fibrolase DNA can be cloned Into any suitable repllcon to create a vector, and thereby be 
maintained in a composition which Is substantially free of vectors that do not contain the fibrolase gene 

5 (e.g., other clones derived from the library). Numerous cloning vectors are known to those of skill In the art, 
and the selection of an appropriate cloning vector Is a matter of choice. Examples of vectors for cloning and 
host cells which they can transform include the bacteriophage x (E coll). pBR322 (E. coll). pACYC 177 (E. 
coil). pKT230. (gram-negative bacteria), pQV1l06 (gram-negative bacteria), pLAFRl (gram-negative bac- 
teiTa), pME290 (non-E. coli gram-negative bacteria), pHV 14 (E. coll and Bacillus subtills) . pBD9 ( Bacillus) . 

10 plJ61 (Streptomyces ). pUC6 (Streptomyces) . actinophage 4>C31 ( Streptomyces) . Ylp5 (Saccharomyces). 
YCp19 (Saccharomyces) , YEp24 and YEp13 ( Saccharomyces ). and bovine papilloma virus (mammalian 
cells). 

75 4. Expression of Fibrolase DNA 

The polynucleotide sequence encoding the fibrolase polypeptide is expressed by inserting the se- 
quence Into an appropriate repllcon thereby creating an expression vector, and introducing the resulting 
expression vector into a compatible host. 

20 In creating an expression vector the sequence encoding the fibrolase polypeptide is located In the 
vector with the appropriate control sequences. The positioning and orientation of the coding sequence with 
respect to the control sequences Is such that the coding sequence is transcribed under the control of the 
control sequences: I.e., the promoter will control the transcription of the mRNA derived from the coding 
sequence; and the ribosomes will bind at the ribosomal binding site to begin the translational process; and 

25 the stop codon used to tennr>lnate translation will be upstream from the transcriptional temiination codon. 
Commonly used prokaryotic control sequences include such commonly used promoters as the ^-lactamase 
(penicillinase) and lactose (iac) promoter systems (Chang et al. Nature (1977) 198:1056) and the tryptophan 
(trp) promoter system (Goeddel et al. Nucleic Acids Res (1980) 8:4057) and the lambda-derived Pl 
promoter and N-gene ribosome binding site (Shimatake et al, Nature (1981) 292:128). Control sequences for 

30 yeast vectors include promoters for the synthesis of glycolytic enzymes (Hess et al, J Adv Enzyme Reg 
(1968) 7:149; Holland et al. Biochemistry (1978) 17:4900). Additional promoters known in the art include the 
promoter for S-phosphogiycerate kinase (HItzeman et al. J Biol Chem (1980) 255:2073). Other promoters, 
which have the additional advantage of transcription controlled by growth conditions and/or genetic 
background are the promoter regions for alcohol dehydrogenase 2 (ADH2). isocytochrome C. acid 

35 phosphatase, degradatlve enzymes associated with nitrogen metabolism, the alpha factor system and 
enzymes respon^ble for maltose and galactose utilization. It Is also believed, transcription temilnator 
sequences are desirable at the 3 end of the coding sequences. Such terminators are found In the 3 
untranslated region following the coding sequences in yeast-derived genes. Expression vectors for mamma- 
lian cells such as VERO. Hela or CHO cells, ordinarily Include promoters and Control sequences compatible 

40 with such cells as. for example, the commonly used early and late promoters from Simian Virus 40 (SV40) 
(Rers et al. Nature (1978) 273:113). or other viral promoters such as those derived from polyoma, 
Adenovirus 2, bovine papilloma virus, or avian sarcoma viruses. The controllable promoter. hMTII (Karin, M.. 
et al, Nature (1982) 299:797-802)* may also be used. 

In addition to control sequences, It may be desirable to add regulatory sequences which allow for 

45 regulation of the expression of the fibrolase gene relative to the growth of the host cell. Examples of 
regulatory systems are those which cause the expression of a gene to be turned on or off in response to a 
chemical or physical stimulus. Including the presence of a regulatory compound. In prokaryotic systems 
these would include the lac and trp operator systems. In eukaryotic systems Induction can occur In 
methallothionein genes with heavy metals and the Mouse Mammary Tumor Virus (MMTV) system with 

50 steroids. In these cases, the sequence encoding the fibrolase polypeptide would be placed In tandem with 
the regulatory element 

There are also selective elements which give rise to DNA amplification which in turn can result in higher 
levels of specific protein production. In eukaryotic systems these Include the dihydrofolate reductase gene 
(dhfr) which Is amplified In the presence of methotrexate, and adenosine deaminase (ADA) in the presence 
55 of deoxycorfomycln. In these cases the sequence encoding the fibrolase polypeptide may either be present 
on the same plasmid or merely be cotransf^cted together with the selectable element to allow for 
Integration within the host cell genome near each other. 

Other types of regulatory elements may also be present In the vector. I.e., those which are not 
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necessarily In tandem with the sequence encoding fibrolase. An example is the SV40 enhancer sequence, 
which, by its mere presence, causes an enhancement of expression of genes distal to it. 

Modification of the sequence encoding fibrolase, prior to its insertion into the replicon, may be desirable 
or necessary, depending upon the expression system chosen. For example. In some cases, It may be 

5 necessary to modify the sequence so that It may be attached to the control sequences with the appropriate 
orientation, i.e., to maintain the reading frame. In some cases, It may be desirable to add sequences which 
cause the secretion of the polypeptide from the host organism, with subsequent cleavage of the secretory 
signal. The techniques for modifying nucleotide sequences utilizing cloning are well known to those skilled 
in the art. They Include, e.g.. the use of restriction enzymes, of enzymes such as Bal31 to remove excess 

10 nucleotides, and of chemically synthesized oligonucleotides for use as adapters, to replace lost nucleotides, 
and in site directed mutagenesis. 

The appropriately modified sequence encoding the fibrolase polypeptide may be ligated to the control 
sequences prior to Insertion into a vector. Alternatively, the coding sequence can be cloned directly into an 
expression vector which already contains the control sequences and an appropriate restriction site. For 

15 expression of the fibrolase polypeptide in prokaryotes and in yeast, the control sequences will necessarily 
be heterologous to the coding sequence. In cases where the fibrolase gene is to be expressed in cell lines 
. derived from vertebrates, the control sequences may be either heterologous or homologous, depending 
upon the particular cell line. 

In experimentation with various expression vectors-integrated, nonintegrated,* intracellular (intemal), 

20 extracellular (secretory)-carried out to date, yeast expression vectors In which the S. cerivlslae a-factor 
leader sequence directs secretory expression of a profibrotase gene encoding the corrected native fibrolase 
sequence and having an Lys Arg (KR) processing site at the pro-mature junction of the profibrolase coding 
region have given the highest yields of active mature fibrolase. Constructs using both nonregulatable 
promoters (e.g., QAPDH) and regulatable promoters- (e.g. ADH2/GAPDH) were made. Use of a regulatable 

25 promoter may be desirable to overcome possible toxic effects from large amounts of fibrolase during cell 
growth. A summary of these constructs is provided in the Examples, infra . 

Depending on the host cell used, transformation is done using standard techniques appropriate to such 
cells. The calcium treatment employing calcium chloride, as described by Cohen, S.N., Proc Natl Acad Sci 

• (USA) (1972) 69:2110. or the RbCl2 method described In Maniatis et al, Molecular Cloning: A Laboratory 

30 Manual (1982) Cold Spring Hartxr Press, p. 254 and Hanahan, D., J Mol Biol (1983) 166: 557-580 may be 
used for prokaryotes or other cells which contain substantial cell wall barriers. For mammalian celts without 
such cell walls, the calcium phosphate precipitation method of Graham and van der Eb. Virology (1978) 
52:546, optionally as modified by Wigler, M.. et al, Cell (1979) 16:777-785 may be used. Transformations 
into yeast may be carried out according to the method of Beggs, J.D.. Nature (1978) 275:104-109 or of 

35 HInnen. A., et al. Proc Natl Acad Sci (USA) (1978) 75:1929 or Ito et al, J Bacteriology 153:163. 

Transformed cells are then grown under conditions whk;h permit expression of the fibrolase gene and. if 
appropriate, processing into the mature protein. Because the gene is expressed in heterologous 
organisms/cells (i.e.. not snake), the protein Is free of the other snake proteins with which It is associated In 
southern copperhead venom. This recombinant fibrolase is also homogeneous and lacks allelic/polymorphic 

40 variations typically found in protein Isolated from pooled snake venom. The thus synthesized recombinant 
fibrolase is then isolated from the host ceils and purified. If the expression system secretes the fibrolase 
into the growth media, the fibrolase is isolated directly from the media. If the recombinant fibrolase is not 
secreted, it is isolated from cell lysates. The selection of the appropriate growth conditions ancJ recovery 
methods are within the skill of the art. With regard to purification, see for instance. U.S. Patent No. 

45 4.610.879. 



5. Use and Administration of Recombinant Rbrolase 

50 Recombinant fibrolase prepared according to the invention may be formulated and administered to 
vertebrates, particulariy mammals including humans, for treatment of thromboembolic conditions in the 
same manner as fibrolase Isolated from venom (see U.S. Pat. No. 4.610.879 and EPA Publication No. 
0020780 In this regard). The fibrolase will typically be formulated with a pharmaceutically acceptable 
injectable carrier such as physiological saline. Ringer's solution and the like for injection into circulation. 

55 Mildly hydrophllic polymeric substances such as albumins, polyvlnylaJcohol. polyethylene glycol, and the 
like may be added to stabilize the fibrolase. The concentration of fibrolase in the injectable will normally be 
in the range of 0.01 to 1 mg/ml. The patient will nomnaity be infused with a dose of atx)ut 0.1 to 80 mg 
fibrolase for adult humans. 
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The following examples further describe the Isolation of DNA encoding fibrolase and the expression of 
5 that DNA in yeast to produce recombinant fibrolase. 

In the following, "digestion" refers to the enzymatic cleavage of DNA by restriction endonucleases. 
Restriction endonucleases commonly referred to as restriction enzymes are well characterized and com- 
mercially available and used In accordance with manufacturer's specifications. Digestion with restriction 
enzymes is frequently followed by treatment with alkaline phosphatase according to manufacturer's 
TO specifications to remove the temiinai 5 phosphates, thus preventing self ligation of a vector having two 
compatible ends. 

"Fill in" refers to the enzymatic process of creating blunt ends by repairing overhanging ends 
generated by certain restriction enzymes. The repair Is a DNA polymerase I large fragment (Klenow) and 
deoxynucleotide triphosphates and is used according to manufacturer's specifications. 
15 Gel Isolation of a DNA restriction fragment refers to the recovery of a specific fragment, elec- 
trophoreticaily separated on either an agarose gel or polyacryiamide gel (depending on size of fragment), 
by either electroelution or melting and extraction of gel slice. 

All DNA manipulations are done according to standard procedures. See Maniatis et al. Molecular 
Cloning, Cold Spring Harbor Lab., 1982. All enzymes used are obtained from commercial sources and used 
20 according to the manufacturer's specifications. 



1. RNA Isolation from Southern Copperhead Venom Glands 

25 The method used to isolate RNA from snake venom glands utilizes guanldinium thiocyanate as a 
chaotroplc agent during tissue homogenization, direct precipitation of RNA from the guanldinium solution by 
LICI followed by purification of the RNA from residual DNA and protein by successive urea-LiCi washes. 

Southern copperiiead snakes (Agkistrodon c. contortrix) were obtained from Zooherp. Inc. Three snakes 
were killed with a lethal dose of Nembutoi and fie six venom glands were dissected free of the surrounding 

30 muscle. They were frozen in liquid nitrogen and the pulverized tissue was homogenized in 8 ml of lysis 
buffer containing 5 M guanldinium thiocyanate. .100 mM Tris«Hci (pH 7.5), 10 mM EDTA (pH 8.0) and 14% 
(v/v) beta-mercaptoethanol using a Tekmar Tissumizer for one min at room temperature. After dissolution, 
the homogenate was clarified by centrifugation (10.000 X g, 4*C. 10 min). The RNA was precipitated (16-24 
hr, 4*C) by adding .5.5 volumes of 4 M LiCI. The RNA and some protein and DNA was then pelleted by 

05 centrifugation (10,000 X g, 4*C. 30 min). The pellet was mechanically resuspended in 5 mi of a solution of 
3 M LICI containing 4 M urea by using a pasteur pipet sealed at the end. and then vortexed vigorously for 
about one min. The volume of the suspension was adjusted to 30 ml and the precipitate collected by 
centrifugation as described above. This procedure was repeated 2 times. The RNA pellet was finally 
dissolved In 2.5 ml of 1% sodium dodecyl sulfate (SDS). One volume of phenol (equilibrated with 50 mM 

40 Tris^Hcl. pH 8.0. 1 mM EDTA and 0.1% beta-mercaptoethanol) was added and the mixture vortexed 
vigorously for about one min. An equivalent volume of chloroform :isoamyl alcohol (24:1) was added and the 
solution vortexed one min. The mixture was centrifuged (4,000 X g, 4* C, 20 min); the aqueous phase was 
extracted with chloroform:lsoamyl alcohol and centrifuged (4.000 X g, 4* C, 10 min). The RNA was 
precipitated (16-24 hr. -20 *C) from the aqueous phase by adding 1/10 volume of 2 M potassium acetate 

45 (pH 5.0) and 3 volumes of absolute ethanol. The precipitated RNA .was collected by centrifugation (10,000 X 
g, 4' C. 30 min) and dissolved In 0.5 ml of dlethylpyrocarbonate-(DEP)-treated water and poiy(A) + RNA 
was prepared as described by Maniatis et ai. The concentration of the poly(A) + RNA was calculated based 
on an extinction coefficient at 260 nm of 25 per mg of RNA. The RNA was precipitated as described above 
and stored at -20* C. 

50 ■ 

2. Construction of cDNA Library from Southern Copperhead Venom Gland RNA 

Double-stranded cDNA was prepared from Southern copperhead venom gland poly(A) + RNA essentially as 
55 described by Qubler, U. and Hoffman, B.J.. Gene (1983) 25:263-269. After methylation of the internal EcoRI 
sites and the addition of Eco RI linkers, the cDNA was llgated into the Eco RI site of XgtlO. These steps are 
described in detail below: 
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A. For first strand synthesis. 3 ug of poly(A)+ RNA in 6 ul of DEP-treated water was used in a total 
volume of 20 ul. The final mixture contained 2 ul of 0.1 M dithiothreitol (OTT). 1 ul of RNasIn (Promega 
Blotec. diluted to 10 unlt&'ul In 10 mM DDT). 2 ul of 0.1 M MgCb. 1 ul of 1M Tri8»Hcl (pH a3). 2 ul of 1 
mg ml oligo(dT)12-18 (P-L Blochemicals). 2 ul of 10 mM each d(ACQT)Tps (P-L Blochemicals). 1 ul of pp] 

5 dCTP (Amersham. 3000 a/mmole). 1 ul of 80 mM sodium pyrophosphate and 2 ul of reverse transcriptase 
(Molecular Genetics Resources). The reactants were combined on ice, Incubated for 2 min at room 
temperature and then for 60 min at 45* C. The reaction was stopped by adding 2 ul of 0.5 M EDTA (pH 
7.0) and then phenol/chloroform/isoamyl alcohol (25/24/1) extracted. The organic phase was extracted with 
an equal volume of TE (10 mM Tris«Hcl, pH 8.0, 1 mM EDTA), the aqueous phases were combined and the 

10 first strand cDNA was precipitated by adding 16 ui of 7.5 M ammonium* acetate and 120 ul of absolute 
ethanol. The pellet of precipitated nucleic acid was collected by microcentrifugatlon, dried In a Savant 
Speed-Vac Concentrator, dissolved In 50 ul of HaO and precipitated by adding 20 ul of 7.5 M ammonium 
acetate and 150 ul of absolute ethanol. This procedure was repeated a third-lime, the pellet was rinsed In 
absolute ethanol and dissolved In 71 ul of DEP-treated water. 

15 B. For second strand cDNA synthesis, 71 ul of first-strand cDNA was used In a total volume of 100 

ul that Included 2 ul of 1 M Trls»Hcl (pH 7,5). 5 ul of 100 mM MgCb, 1 ul of 1 M ammonium sulfate. 10 ul 
of 1 M KCI. 1.25 ul of 4 mg/ml bovine serum albumin. 5 ul of 1 mM each of d(ACQT)TPs. 1 ul of pP]- 
dCTP (Amersham. 3000 Ci/mmole), 1.5 ul of RNase H (Bethesda Research Laboratories) and 2.3 ul (23 
units) of DNA polymerase I (New England Biolabs). The reaction was incubated for 1 h at 14* C and then 

20 for 1 h at room temperature. It was tenriinated by the addition of 4 ul of 0.5 M EDTA (pH 7.0) and extracted 
with an equal volume of phenol/chloroform/lsoamyl alcohol. The double-stranded cDNA was precipitated by 
the addition of 40 ul of 7.5 M ammonium acetate and 300 ul of absolute ethanol. The pellet was 
reprecipitated two times as described in step I. Rnally. the pellet of cDNA was rinsed In absolute ethanol. 
dried and dissolved in 25 ul of DEP-treated water. 

25 C, To fill In the ends of the cDNA so that chemically synthesized Iinl<ers could be (Igated to them, 7.7 

ul of DEP-treated water was added to the 25 ul of double-stranded cDNA. The final reaction volume was 50 
ul and included 5 ul of (0.5 M Tris«Hcl. pH 8.0. 60 mM MgCb and 250 mM KCI). 5 ul of 10 mM DTT. 5 ul 
of 1 mM each d(ACQT)TPs, 1.3 ul of 4 mg/ml BSA and 1 ul (5 units) of T4 DNA polymerase (Bethesda 
Research Laboratory). The reaction was incubated at 37* C for 30 min^and terminated by the addition of 2 

30 ul of 0.5 M EDTA (pH 7.5). The solution was heated for 10 min at 70* 0 and the nucleic acid precipitated 
by the addition of 20 ul of 7.5 M ammonium acetate and 150 ul of ethanol. The precipitate was collected, 
reprecipitated once, rinsed and dried as described In step I. The DNA was dissolved In 27.15 ul of TE. 

D. To protect internal EcoRI sites within the DNA frorh subsequent cleavage, the DNA was modified 
by EcoRI methylase. To 27.F7II of DNA was added 10 ul of 5X buffer (0.5 M Tris«Hcl. pH 8.0. 5 mM 

35 EDTA), 5 Ul of 4 mg/ml BSA, 5 ul of 100 uM SAM (S-adenosyl-L-methionine. Sigma) and 2,5 ul (50 units) 
EcoRI methylase (New England Biolabs). The reaction was Incubated at 37* C for 30 min and then heated 
at 70* C for 10 min. The methylated cDNA was precipitated by adding 20 ul of 7.5 M ammonium acetate 
and 150 ul of absolute ethanol. The precipitate was collected, reprecipitated twice, rinsed and dried as 
described In step A. 

40 E. EcoRI dodecamer linl<ers were kinased In a total volume of 50 ul as follows; 15 ul (1.5 mmoles) of 

linker solution (Collaborative Research). 5 ul of 10X kinase buffer (700 mM Tris.Hcl, pH 7.5. 100 mM 
MgClz, 50 mM DTT). 5 ul of 10 mM ATP. 22 ul of HaO and 3 Ul (15 units) of T4 polynucleotide kinase 
(New England Biolabs). The reaction was incubated at 37* C for 2 h. 

F. To iigate the kinased ^Rl linkers to the methylated cDNA. 10 ul of water was added to the dried 
45 cDNA from step D followed by 10 ul of linkers from step E, 1.5 ul of 10 mM ATP, 1.5 ul of 10X ligase 

buffer (0.5 M Tris»Hcl pH 8,0. 0.1 M Mgei2. 0.2 M DTT) and 2 ul of T4 DNA ligase (New England Biojabs). 
The total reaction of 25 ul was incubated at 14* C ovemlght. 

G. The cDNA, the T4 DNA ligase was inactivated by heating the reaction from step F for 1 5 min at 
70' C. To the mixture was added 70 ul of water, 10 ul of 10X TMN (100 mM TristHcl, pH 7.5, 100 mM 

50 MgCIa and 1.0 M NaCI) and 5 ul (100 units) of EcoRI. The 100 ul reaction was incubated at 37* C for 3 h to 
remove excess linkers from the cDNA and then phehoi/chlorofomn/isoamyl alcohol extracted. One-tenth 
volume of 5 M NaCI was added to the aqueous phase and the -trim'med" cDNA was fractionated from, 
excess linkers by gel filtration on a 0.7 X 18 cm Sepharose CL4B column. The elOtion buffer was 0.4 M 
NaCI, 10 mM Tris»Hcl. pH 8.0. 1 mM EDTA. The radioactivity of the fractions was determined and the peak 

5$ of cDNA pooled and precipitated one time with 2 volumes ethanol as described In step A then precipitated. 
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H. To prepare the Xgt10 vector for cloning the cDNA, 25 tiQ of xgtIO was digested with EcoRI. The 
final fBaction volume was 250 ul and contained IX TMN and 100 units of EcoRI, The enzyme digestion was 
carried out at 37" C for 2 h and was then extracted with chloroform. The linearized XgtlO was precipitated 
one time with ethanol as described in step A. The dried DNA was dissolved In 100 ul of TE. 

5 I To llgate the vector DNA to the cDNA. 12 ul (3 ug) was added to the ethanol-precipitated EcoRI- 

linked cDNA (step Q). The mixture was dried and 7 ul of water was added followed by 1 ul of 10X llgase 
buffer and 1 Ul of 10 mM ATP. The mixture was Incubated at 37* C for 5 min. chilled to 4* C and 1 ul of T4 
DNA llgase was then added. After Incubation at 14* C overnight, the ligation mixture was stored at -20* C. 
J. To package the recombinant DNA from step I into phage particles and plate them using E. coll 

70 strain C600Ahfl as a host, one-third (3 ul) of the ligation mixture was treated using Gigapack-plus according 
to the manufacturer's (Stratagene) Instructions. Two hundred thousand recombinant phage we^e obtained. 



3. Screening of the Snake Venom Gland cDNA Library 

76 

One thousand recombinant phage from the southern copperhead snake venom gland cDNA library were 
plated onto Luria agar using E, coll host strain CBOOAhfl according to the manufacturer's (Strategene) 
instructions, The transfer of phage"S"NA to nitrocellulose filters (phage lifts) and the subsequent hybridiza- 
tion of the filters with 32p-iabeled single stranded (first strand) cDNA probe were done essentially as 

20 described by Maniatis et al. This strategy was selected since fibrolase is an abundant protein in snake 
venom and, therefore, the mRNA encoding this enzyme is expected to also be abundant. 

The plaque lifts were done in duplicate, and 4he filters preannealed overnight at 42* C in 50% 
formamide, 5% BSC. 2x Denhardt's, 50 mM sodium phosphate, pH 6,5, 0.25% SDS and 100 ug/ml 
denatured salmon testes DNA. The filters were then hybridized with 250.000 cpm/ml of the 32p-iabeled 

26 stranded cDNA probe (boiled for 10 min in 0.4 N NaOH) made from the poly(A)+ RNA using the procedure 
described previously. Hybridization was done at 42*C overnight In preanneal solution that also contained 
10% (w/v) sodium dextran sulfate. Washing of the filters was done In 2x SSC. 0.1% SDS for 30. min at room 
temperature, followed by 30 min at 50'C. The autoradiogram was exposed with an intensifying screen at 
80 *C overnight. 

30 Sixty of the strongly hybridizing plaques were removed by coring and the phage eluted into PSB (100 
mM NaCI, 10 mM Tris. pH 7.5, 10 mM MgCfe, 500 mg/l gelatin). By plating appropriate dilutions of the 
phage stock eluate and screening by hybridization as described, an Isolated bacteriophage colony was 
obtained for six of the original plaque. 

Lambda DNA was Isolated from each clone and the EcoRI-EcoRI cDNA inserts were purified away from . 

35 the vector. 

Restriction digests' were done according to manufacturer's (New England Biolabs) instructions on the 
EcoRI fragments using PstI and Hindlll separately. The results of the restriction analyses are shown in 
Table 1. The restriction analyses show that of the 6 clones screened, there are 3 distinct classes. Two of 
these (SVG-48 and SVG-51) weiB completely sequenced. The sequences are shown In Figs 1 and 2, 
40 respectively. 

Tablet 



so. 



Clone 


Eco Rl 


PstI 


Hind III 




Fragments 


Fragments (kb) 


Fragments 




(kb) 




(kb) 


SVG-6 


1.1 


0.5-.0.6 + 


1.1 


SVG-8 


1.8 


0,^,0.6, 1,1 


0.4. 1,4 


SVG-20 


1.3 


0.5. 0.8 


1.3 


SVG-23 


1.8 


0.2, 0.5-. 1.1 


0.4.1.4- 


SVG-48 


1.8 


0.2. 0.6, 1,1 


0.4.1.4 


SVG-51 


1.8 


0.2, 0.5. 1.1 


0.4, 1.4 



55 

4. Intracellular Expression of Fibrolase Protein 



9 



EP0p0323722 (http://vwtfw.getthepat entxom/lo Qin.dog /$folev/Fetch/EP0003237 22.cpc^^^^ 



EP 0 323 722 A1 

The following describes the intracellular expression of the fibrolase protein in yeast under the control of 
the regulatable promoter ADH2-glyceraldehyde-3-phosphate-dehydrogenase (GAPDH or GAP) and GAPDH 
terminator. 

5 

A. Construction of a Yeast Expression Vector for Fibrolase: 

The scheme used to prepare the yeast expression vector is shown in Rgure 5. 

SVG 48 (or SVG 51) was digested with StuI and PstI and the resulting 717 bp Stul/PstI fragment was 

70 gel isolated. That fragment was ligated to the Hindlll/StuI and Psti/Sall synthetic adaptors shown in Figure 5. 
The resulting 742 bp Sall/Hindlil fibrolase gene encoding 231 amino acids was ligated to a 1341 bp ADR2- 
GAPDH promoter fragrSent (Figure 6). The resulting ligation product was ligated into the plasmid pPGAP-1 
which had been cut with Bam HI and Sail. pPGAPI is a yeast expression cassette vector which has a 
polyrestriction site linker between the SSPDH temiinator and a truncated GAPDH promoter region. The 

15 polyrestriction site contains the recognition sites for Nco l. EcoRI, and Sail, and the cassette is excisable as 
a BamHI fragment. The preparation of pPGAPI is describe"dTh EPO 0 164 556 and Travis. J., et al, J Biol 
CherTr(1985) 260(7):4384-4389. In both references pPGAPI is referred to pPGAP. The resulting plasmid was 
called PBRF248 (or pBRF251). 

pBRF248 (or pBRF251) was digested with BamHI and the resulting Bam HI cassette containing the 

20 ADH2-GAPDH promoter, fibrolase structural gene, and GAPDH terminator was gel purified and ligated In 
plasmid pAB24 which had been cut with Bam HI. pAB24 Is a yeast shuttle vector which contains the 
complete 2 u sequences (Broach, In: Molecular Biology of the Yeast Saccharomyces, 1:445. Cold Spring 
Harbor Press (1981)) and pBR322 sequences. It also contains the yeast URA3 gene derived from plasmid 
YEp24 (Botstein et al, Gene (1979) 8:17) and the yeast LEU2^ gene derived from plasmid pCI/l (described 

25 in European Patent Application publication no, EP01 16201). Insertion of the expression cassette was in the 
BamHI site of pBR322, thus interrupting the gene for bacterial resistance to tetracycline. The resulting 
piaTmid was designated pAB24F248 (or pAB24F251). 

Plasmid pAB24F248 (or pAB24F251) was transformed Into yeast strain S. cerevisiae 2150-2-3 Mate, 
adel. Ieu2'04 . [cir*] (strain from the collection of L. Hartwell, University of Washington. Seattle) and plated 

30 orTieu" 8% glucose, Transformants were streaked on leu" 8% glucose to obtain single colonies which were 
patched onto leu" 8% glucose plates. Cells from 8 patches were grown for -24 hr in leu"^8% glucose 
minimal media and then Inoculatetd (1:25 v/v) In YEP 1% glucose and grown for 96 hr at 30 *C. Samples 
(2.5 CD units) were collected for gel analysis at 24, 48, 72. and 96 hr. 

Cell samples were boiled in SDS sample buffer (0.0675M Tris«CI pH 6.8, 3% SDS, 10% glycerol) 

35 containing 50 mM DTT and run on 12.5% SDS-acrylamide gels. Coomassie staining of the gels showed a 
prominent band in the predicted 25.9 kd size range. This band accounted for -20% of the totel cell protein 
and was not found In pAB24 control extracts. It was seen in samples collected at each time point 

Western blot analyses were carried out on eight 48 hr samples of pAB24F248 and pAB24F251 
transformants using rabbit anti-fibrolase sera. The antisera reacted strongly with the 25.9 kd band In all 

40 eight samples. 

There were also lower molecular weight bands that reacted with the antisera, suggesting some 
degradation of the fibrolase in the yeast. 

45 B. Construction of Yeast Expression Vector Encoding Con-ected Native Fibrolase Sequence 

Using pAB24F251 as a starting plasmid an Internal expression plasmid was constructed which encodes 
the corrected native sequence (i.e., the authentic amino terminal sequence of mature southem copperhead 
fibrolase preceded by a Met residue [i,e., (Met) Gln-Gln-Arg-Phe-Pro-Gln-Arg..„] and the amino acid 
50 changes at positions 123 and 170 (described supra) of the mature protein). Sequence changes were made 
by conventional in vitro mutagenesis techniques. This plasmid was designated pAB24F751. 

S. cerevisiae cells transformed with pAB24F751 were observed to express a high level of protein 
corresponding in size to native fibrolase. Testing of cell lysates by the azocasein hydrolysis assay, however, 
showed no activity. 

56 

5. Constiruction of Plasmids For Secretory Expression of Mdture Fibrolase and Proflbrolase 
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. A series of plasmlds were made to direct the secretory expression of mature fibrolase or profibrolase 
(Profib). These plasmids are summarized in the following table. 



70 



75- 



20 



26 



30 



35 



40 



45 



50 



55 



Plasmid 


Vector 


Promoter 


Leader 


Fibrolase 


pKS308 


pAB24 


GAP 


aF 


Mature Fibrolase 


pKS311 


pAB24 


GAP 


oF 


Profib 


pKS313 


pAB24 


ADH2/QAP 


aF 


Pro(KR)fib 


pKS3l4 


pAB24 


GAP 


aF 


Pro(KR)fib 


pKS316 


pAB24 


ADH2/GAP 


aF(AKR) 


Pro(KR)fib 


pKS317 


pAB24 


GAP 


aF(AKR) 


Pro(KR)flb 


PKS359 


pAB38 


GAP 


oF 


Profib 


pKS362 


pAB38 


GAP 


aF 


Pro(KR)fib 


pKS365 


pABGB 


GAP 


aF(AKR) 


Pro(KR)flb 



GAP = glyceraldehyde-3-phosphate-dehydrogenase 

ADH2 = alcohol degydrogenase 2 

oF = S. cerevlsiae alpha factor 

KR = Lys-Arg processing site 

AKR = minus Lys-Arg processing site 



The fibrolase gene of pKS308 was derived from pAB24F751 and thus encodes the corrected native 
sequence. The profibrolase gene used in pKS3l1 was derived from Fib 51 and thus does not have the 
amino acid changes of the con-ected native sequence but does include the 18 amino acid C-temninal 
extension of the Fib 51 sequence. A K. lactfs counterpart of pKS3ll (designated pKS359) was made using 
the K. lactis vector pAB38 instead of pAB24. pAB38 Is a derivative of pUC18 (Yanisch-Pen-on et a!.. Gene 
(1985) ?3!l53) and was prepared as follows. pUC18 was cleaved with Hindlll and a 1.2 l<b HIndlll fragment 
containing the URA3 gene (Botstein et al.. Gene (1979) 8:17) was llgated Into pUC18. Further, a PKDI 
(Falcone et al.. Plasmid (1986) 15:248) Eco RI insert from Kluyveromyces drosophtlarum was ligated into the 
•EcoRI site of pUC18, Finally, the vector was cleaved with Narl/Hindlll (partial), filled in with Klenow, and 
religated to yield pAB38. The S. cerevisiae o-factor leader sequence and its incorporation into vectors is 
described in EPO Publlcation"ll6,201. The ADH2/GAP promoter and Its Incorporation into vectors is 
described In EPO Publication 164,556. 

Expression of the pKS3l1 and pKS359 constructs in transformed S. cerevlsiae and K. lactis yielded a 
substantial amount of immunoreactlve species having the size of profibrolase, it thus appeared that a 
substantial portion of profibrolase was not being processed into the mature protein. Constructs were thus 
made In which a Lys-Arg processing site was introduced using synthetic oligonucleotides at the pro-mature 
junction which should bo recognized by the yeast KEX2 -encoded protease, the same enzyme that carries 
out the cleavage at the junction of the a-factor leader. Figure 7 shows the sequence generated by this 
mutagenesis process on a pAB24F751 -derived a-factor profibrolase fusion construct. This sequence, being 
•thus derived, includes the amino acid changes of the corrected native sequence but lacl<s the 18 C-terminal 
residues of the Rb 51 profibrolase sequence. The segment coding for those C-termlnal residues may be 
Introduced if desired. Such constructs were made using the GAP promoter (pKS314 in the table) and the 
ADH2/GAP promoter (pKS313 in the table). A corresponding construct was made based on the K. lactis 
vector pAB38 (pKS362 in the table). 

pKS314 and pKS362 were introduced into S. cerevisiae and K. lactis strains and the production and 
secretion of mature fibrolase from the transformants was analyzed by SDS gel electrophoresis and Westem 
blot analysis. Both secreted mostly mature fibrolase. 

pKS314 and pKS362 each contain two Lys-Arg processing sites, one at the a-factor-pro junction and 
another at the pro-mature junction. In order to mininpize the possibility of there being insufficient KEX2 
protease to process all of the profibrolase being produced, a synthetic oligonucleotide adapter was used to 
remove the Lys-Arg site at the a-factor-pro junction. These constructs are designated pKS316. pKS317. and 
pKS365. 

The fibrolase products produced using the expression vectors listed in the above table were tested for 
fibrolase activity by either the azocasein hydrolysis assay or the fiuorometric assay. 

The azocasein hydrolysis assay is canied out as follows. A solution of 2.5 g azocasein in 50ml 1% 
NaHCOa is prepared with stirring and warming to 60'C. The resulting solution Is then dialyzed overnight 
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against 4 1 of 1% NaHCOa. A 50 ul portion of culture supematent concentrated and dialyzed against 50 mM 
HEPES/pH 7.5 by ultrafiltration Is then added to 1 ml of azocasein solution and the mixture is Incubated for 
approximately 30 minutes-24 hours. One ml of 1.16 M perchloric acid is then added, the mixture is 
centrifuged for 10 minutes, and the absorbance of the supernatant Is read at 390-440 nm against an 
5 appropriate control. 

For the ftuorometric assay, a fluorescein isothlocyanate casein (FITC-casein) assay reagent is prepared 
by mixing 1 part FITC-casein (5 mg/ml), 1 part 2X buffer (.IX buffer = 50 mM HEPES-Na*. 150 mM NaCI. 
0.5 mM ZnCIa, pH 7.5). Recombinant fibrolase product and snake venom fibrolase standards are diluted in 
IX buffer containing 100 ttg/ml BSA as a stabilizer. 

10 Assay reagent, 80 ul Is brought to 37* and 20 ul of fibrolase sample or standard is added. After a 90 
minutes incubation at 37', the reaction is stopped by the addition of 300 iit of 7% TCA. After 30-60 
minutes at 4*. the mixture Is centrifuged for 6 minutes at 12.000 rpm. A 200 ul portion of the supernatant is 
removed and added to.1 ml of 0.5 M Tris-HCI, pH 8.0 and the fluorescence measured at an excitation 
wavelength of 490 nm and an emission wavelength of 525 nm, and the value compared to the fibrolase 

75 standards. For determination of non-fibrolase protolytic activity, assays were done in the presence of 10 
mM EDTA, an inhibitor of fibrolase. 

All of the recombinant fibrolase products produced using the expression vectors listed in the table 
exhibited fibrolase activity in these assays. 

Modifications of the above described modes for carrying out the invention that are obvious to those of 

20 skill in the fields of enzyme chemistry, genetic engineering and/or medicine are intended to be within the 
scope of the following claims. 



Claims 

25 

1. Recombinant DNA encoding a Crotalidus fibrolase. 

2. The recombinant DNA of claim 1 wherein the fibrolase is southern copperhead fibrolase. 

3. The recombinant DNA of claim 2 wherein the fibrolase comprises at least (a) amino acids 192-393 of 
the Fib 48 sequence shown in Rgure 4, (b) amino acids 186-387 of the Fib 51 sequence of Figure 4 or (c) 

30 amino acids 185-387 of the Fib 51 sequence of Figure 4 with a Pro to Ser substitution at amino acid 307 of 
the Fib 51 sequence of Figure 4 and a Val to Met substitution at amino acid 354 of the Rb 51 sequence of 
Figure 4. 

4. The recombinant DNA of claim 2 wherein the fibrolase is an analog of the fibrolase comprising at 
least (a) amino acids 192-393 of the Fib 48 sequence shown in Rgure 4 (b) amino acids 186-387 of the Rb 

35 51 sequence of Rgure 4 or (c) amino acids 185-387 of the Fib 51 sequence of Figure 4 with a Pro to Ser 
substitution at amino acid 307 of the Rb 51 sequence of Rgure 4 and a Val to Met substitution at amino 
acid 354 of the Rb 51 sequence of Rgure 4. 

5. An expression vector for expressing a Crotalidus fibrolase comprising the DNA of claim 1 and 
expression control sequences that are operably linked to said DNA and effective In directing expression of 

40 said DNA. 

6. An expression vector for expressing southern copperhead fibrolase comprising the DNA of claim 2. 3 
or 4 and expression control sequences that are operably linked to said DNA and effective in directing 
secretory expression of said DNA, 

7. The expression vector of claim 6 wherein the expression vector is a yeast expression vector. 

46 8. The expression vector of dalm 7 wherein the expression control sequences Include a yeast a-factor 
leader sequence for directing secretion of the fibrolase. 

9. The expression vector of claim 8 wherein the DNA encodes the profibrolase sequence shown in 
Rgure 7 and in which there is a Lys-Arg processing site at the pro-mature protein sequence. 

10. A recombinant microorganism or cell containing the expression vector of claim 5 and being capable 
50 of producing a Crotalidus fibrolase. 

1 1 . A recombinant microorganism or ceil containing the expression vector of claim 6 and being capable 
of producing southern copperhead fibrolase. 

12. A recombinant yeast containing the expression vector of claims 7, 8 or 9. 

13. A method of producing a Crotalidus fibrolase comprising growing the microorganism or ceil of claim 
55 10 under conditions whtoh pemnlt the expression of the fibrolase. 

14. A method of producing southern coppertiead fibrolase comprising growing the recombinant yeast 
ceil of cl§lm 12. 
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15. Recombinant homogeneous Crotalidus fibrolase free of snake proteins with which It is associated In 
venom. 

16. The recombinant Crotalidus fibroiase of claim 15 wherein the fibrolase Is southern copperhead 
fibrolase. 

5 17. The recombinant southem copperhead fibroiase of claim 16 comprising at least (a) amino acids 
192-393 of the Fib 48 sequence shown In Rgure 4, (b) amino acids 186-387 of the Fib 51 sequence of 
Rgure 4 or (c) amino acids 185-387 of the Rb 51 sequence of Figure 4 with a Pro to Ser substitution at 
amino acid 307 of the Rb 51 sequence of Figure 4 and a Vai to Met substitution at amino acid 354 of the 
Rb 51 sequence of Figure 4, or an analog thereof. 

10 18. A pharmaceutical composition for treating a thromboembolic condition comprising the recombinant 
Crotalidus fibrolase of claim 15. 16 or 17 admixed with a phamiaceutlcally acceptable injectable carrier. 

19. A method of treating a patient for a thromboembolic condition comprising administering a 
therapeutically effective amount of the recombinant Crotalidus fibrolase of claim 15. 16 or 17 to the patient 
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FIB48 MAP 



\ Neu eingsrsicht / Newiy iiied 
Nouyeilerr.sntd225!Hl 



.1—1. 



.11. 



SACI 



PSTI 



SPHX 
HINDS 



SACI 
MAEI 



AFL2 PSTI 
AVA3 PSTI 



.ILL 



BSTE2 
NDBI 
STUI 



IJ 1. 



SPEI 
MDEI 
SSPI 



SSPI 



J-_l. 
BALI 



ASU2 



2 TCAGGTTGACTTGAAAGAAGGAAGAGATTGCCTGTCTTCCAGCCAAATCCAGCCTCCAAA 
AGTCCAACTGAACTTTCTTCCTTCTCTAACGGACAGAAGGTCGGTTTAGGTCGGAGGTTT 

MetlleGlnValLeuLeuValThrlleCysLeuThrAlaPheProTyrGlnGlySerSer 
62 ATGATCCAGGTTCTCTTGGTGACTATATGCTTAACAGC'CTTTCCTTATCAftGGGAGCTCT 
^ACTAGGTCCAAGAGAACCACTGATATACGAATTGTCGAAAAGGAATAGTTCCCTCGAGA 

115 SACI 

IlelleLeuGluSerGlyAsnValAsnAspTyrGluValValTyrProArgLysValThr 
122 ATAATCCTGGAATCTGGGAACGTGAATGATTATGAAGTAGTGTATCCACGAAAAGTTACT 
TATTAGGACCTTAGACCCTTGCACTTACTAATACTTCATCACATAGGTGCTTTTCAATGA 

180 PSTI 

AlaValProArgGlyAlaValGlnProLysTyrGluAspAlaMetGlnTyrGluLeuLys 
18 2 GCAGTGCCCAGAGG AGCAGTTCAGCCAAAGTATGAAGATGCCATGCAATATGAATTGAAA 
CGTCACGGGTCXCCTCGTCAAGTCGGTTTCATACTTCTACGGTACGTTATACTTAACTTT 

ValAsnGlyGluProValValLeuHisLeuGluLysAsnLysGlyLeuPheSeirGluAsp 
242 GTGAATGGAGAGCCAGTQGTCCTTCACCTGGAAAAAAATAAAGGACTTTTTTCAGAAGAT 
CACTTACCTCTCGGTCACCAGGAAGTGGACCTTTTTTTATTTCCTGAAAAAAGTCTTCTA 

TyrSerGluThrHisTyrSerProAspGlyArgGluIleThrThrTyrProLeuValGlu 
302 TACAGCGAGACTCATTATTCCCCTGATQGCAGAGAAATTACAACATACCCCCTGGTTGAG 
ATGTCGCTCTGAGTAATAAGGGGACTACCGTCTCTTTAATGTTGTATGGGGGACCAACTC 

AspHisCysTyrTyrHisGlyArglleGluAsnAspAlaAspSerThrAlaSerlleSer 
•362 6ATCACTGCTATTATCAT6GACGCATCGAGAATGATGCTGACTCAACTGCAAGCATCAGT 
C*AGTGAC6ATAATAGTA0CTG06TAGCTCTTACTA0GACTGAGTTGACGTTCGTAGTCA 

AlaCysAsnGlyLeuLysGlyHisPheLysLeuGlnGlyGluMetTyrLeuIleGluPro 
422 GCATGCAACGGTTTGAAAGGA(»TTTCAAGCTTCAA6GGGAGA1GXACCTTATTGAACCA 
CGTACGTTGCCAAACTTTCCTGTAAAGTTC6AA6TTCCCCTCTACATGGAATAACTTGGT 

422 SPHI, 449 .HIND3 

LeuGluLeuSerAspSerGluAlaHisAlaValTyrLysTyrGluAsnValGluLysGlu 
482 TTGGAGCTTTCCGACAGTGAAGCCCATGCAGTCTACAAATATGAAAATGTAGAAAAAGAG 
AACCTCGAAAGGCTGTCACTTCGGGmCGTCAGAXGTTTATACTTTTACATCra 
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AspGliiAlaProLysMetCysGlyValThrGlnAsnTrpGluSerT^ 
542 <3ATGAG6CCCCCAAAAax:T6TG666TAACCCA6i^IT6GGAATCATAI6A6CC^ 
CTACTCCG6GGGTTXTACACACCCaVXTGGGTCTTAACCCTTA6XATA(^^ 

565 BSTE2, 5B5 NDEI 

LysAlaPheGlnLeuAsnLeuThrPrcXSluGlnGlnGlyPheProGlnArgTyrValGlu 
602 AAQGCCTTTCAfiTTAAATCTTACTCCTGAACAACAAGGATTCCCC(»AAGATACGTT^ 
TtCCGGAAAGTCAATTTAGAATGAGGACTTGTTGTTCCT^ 

603 STUI 

LeilVallleValAlaAspHisArgMetTyrThrLysTyrAsnGlyAspSerAspLysIle 
662 CTTGTCATAGTTGCGGATCACAGAATGTACACSAAATACAATCGTGATTCAGATAAG^ 
GAACAGTATCAACGCCTAGTGTCTTACATCTGCTTTATGTTACCACTAAGTCTATT 

ArgGlnTrpIleTyrArgMetValAsnThrlleAsnGluIleTyrArgP roLe uAsnlle 
722 AGACAATGGATATATCGAATGGTCAACACTATAAATGAGATTTACAGACCTT^ 
TCaX3TTACCTATATAGCTTACCAGTTGTGATA!5JTACTCTAAATGTC 

Q 776 SSPI 

GlnPheValLeuValGlyLeuAspIleTrpSerLysLysAspLeuSerThrValThrSer 
782 CAATTCGTACTGGOTOGCCTAGACATTTGGTCCAAGAAAGATTTGAGTACCGTG^ 
GTTAAGCATGACCAACCGGATCTGTAAACCAGGTTCTTTCTAAACT 

ValSerHisAspThrLeuAlaSerPheGluAsnTrpArgGlnThrAspLeuLeuAsnArg 
842 GTATCACATGATACTTTGGCCTCATTTGAAAACTGGAGAC»GAaVGATTTGCTC 
CATAGTGTACTATGAAACCGGAGTAAACTTTTCACCTCTGTC^ 

LysSerHisAspAsnAlaGliiLeuLeuThrAlalleValPheAspGluGlyllelleGly 
902 AAAAGTCATGATAATGCCCAGTTACTCACKGCCATIGTCTTCGATCAAGGAAT^^ 

TTTTCAGTACTATTACGGGTCAATCAGTGCCGGTAACAGAAGCTACTTCCTTAAT^^ 

ArgAlaProLeuAlaGlyMetCysAspProMetPheSerValGlylleValGluAspHis 
9 62 AGAGCTCCCCTAGCCGGCATGTGTGACCCGATGTTTTCTGTAGGAATTCTTGAGGAT^ 
TCTCGAGGGGATCGGCCGTACACACTGGGCOa^CAAAAGACATCCTTAACAACTCCTA^ 

963 SACI, 974 NAEI 

^ SerAlalleAsnLeuLeuValAlaLeuThrMetAleOiisGluLeuGlyHisAsnLeuGly 
C 1022 AGTGCAATAAATCTTTTGGTTGCACTTACAATGGCCCATGAGCT^ 

TaCGTTATTTAGAAAACCAACGTGAATGTOaVCCGGGTACT 

MetAspHisAspGlyAsnGlnCysHisCysGlyAlaAsnSerCysValHetAlaAspThr 
1082 ATGGATCATGATGGAAATCAGTGTCATTCCGGTGCTAACXCGXGCGTTATGGCTGA^ 
TACCTAGTACTACCnTTAGTCACAGTAACGCCACGATTC 

LeuSerAsnGlnProSerLysLeuPheSerAspCysSerLysLysTyxTyrGlnLysPhe 
1142 CTAAGTAATCAACCTTCCAAACTATTCAGCGATTGTAGTAAGAAATACTATCAGAAGTTT 
GATTCATTAGTTGGAAGGTTTGATAAGTCGCTAACATCATTCTTTATGAl^^ 

LeuLysValLysAstxProGlnCysIlel^uAsnLysProLeiiArgThrAspThrValS 
1202 CTTAAGGTTAAAAACCCACAATGCATTCTCAATAAACCCTTGAGAACAGATACT^ 
•GAATTCCAATTTTTGGGTGTTACGTAAGAGTTATTTGGGAACTCTTC 

1202 AFL2, 1222 AVA3 

ThrProValSerGlTAsnGluLeuLeuGluAlaOP 

1262 ACTCCAGTTTCTGGAAATGAACTTrEGGAGGCGTGAGAAGAAIGTGACTGTGGCTCTCCT 
TGAG(3Ta«AGACCTTl»CTTGAAAACCTC03CACrCTTCT^^^ 



1320 PSTI 
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1322 GCAGTCTGCAGCAACAGGCAGTGTGTTGATCTGACTACAGCC^ 
CGTCAGACGTCGTTGTCCGTCACACAACIACACTGATGTCGGAI^ 

1327 PSTI 

1382 TateTCTa«»TTTGATCTTGGAGATCCTTCTTTCAGAAGGTT^ 
Q AAGAGAGTCTAAACTAGAACCTCTAGGAAGAAAGTCTTCCAAACCGAAGGGACA^ 

1442 AAGAGACCCATCTGCCTCCATCCTACTAGTAAATCACTCTTAGCTTT^ 

TTCTCTGGGTAGACGGACGTAGGAgPGAOX^TTlJAGTGAGAATCGAAAGTAT^^ 

1466 SPEl, 1489 NDEI 

1502 ACTTCTGCAATATTTCTTCTCCATATTTAATCTG!mACCTTTTGCTGTAAT(^^ 

TGAAGACGOTATAAAGAAGAGGTATAAATTAGACAAATGGAAAACGACATTAGTTTQGAA 

1510 SSPI 

1562 TTCCCACCACAAAGCTCTATGGGCATGTACAACACCAACGGCOT^ 

AAGGGTGGOXSTTTCGAGATACCCGTACATGTTGTGGTTGCCGAATAGACGACAGT^^ 

1622 AAAAATGGCCATTTTACCGTTTGCCAAAGCACATTTAATGCAACaAGTTCTX;^ 
TTTTTACCGGTAAAATGGCAAACGGTTTCGTGTAAATTACGTTGTTCAAGACX3 

1627 BALI 

1682 GCTGGTGTATTCGAAGTGAATGTTTACTCTCCCAAAATTTCATGCTGGCTT^ 
1691 ASU2 

1742 GTAGCTGCTTCCGTCAATAAACTAACTATTCTCATTCA AAAAAA A 
CATCGACGAAGGCAGTTATTTGAnGATAAGAGXAAGTTITTrTT 
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FIB51 MAP 



• ■ ■ ■ I. ' - I I I I ■ .-^ 

I Neu eingereicht / N-cvvly Viiod i 



■'SACI AHA3 SPHI BSTE2 SSPI NARI 

HIND3 NDEI 
* STUI 

.1 I 1 1 LI I l_L 

SSPI 



BCLI PSTI SPEI BALI 

AVA3 PSTI NDEI . ASU2 



ValThrlleCysLeuAlaAlaPheProTyrGlnGlySerSerllelleLeuGluSerGly 
2 GTAACTATATGCTTAGCAGCTTTTCCTTATCAAGGGAGCTCTATAATCCOX^GAATCTGGG 
CATTGATATACGAATCGTCGAAAAGGAATAGTTCCCTCGAGATATTAGGACCTTAGACrc 

C 37 SACIr 

AsnValAsnAspTyrGluValValTyrProArgLysValThrProValProArgGlyAla 
6 2 AACGTTAATGATTATGAAGTAGTGTATCCACGAAAAGTCACTCCAGTGCCCAGAGGAGCA 
TTGCAATTACTAATACTTCATCACATAGGTGCTTTTCAGTGAGGTCACGGGTC^ 

ValGlnProLysTyrGluAspAlaMetGlnTyrGluPheLysValAsnGlyGluProVal 
122 GTTCAGCCAAAGTATGAAGATGCCATGCAATATGAATTTAAAGTGAATGGAGAGCCAGTC 
CAAGTCGGTTTCATACTTCTACGGTACGTTATACTTAAATTTC^ 

158 AHA3, 

ValLeuHisLeuGluLysAsnLysGlyLeuPheSerGluAspTyrSerGluThrHisTyr 
1B2 GTCCTTCACCTGGAAAAAAATAAAGGACTTTTTTCAGAAGATTACAGC^^ 
CAGGAAGTGGACCTTTTTOTATTTCCTGAAAA^ 

SerProAspGlyArgGluIleThrThrTyrProl^uValGluAspHisCysTyrTjTHis 
242 TCCCCTGATGGCAGAGAAATTACAACATACCCCCTGGTTGAGGATCACTGCTATTATCAT 
AGGGGACTACCGTCTCTTTAATGTIGTATGGGGGACCAACTCCTA^ 

C GlyArgileGluAsnAspAlaAspSerThrAlaSerlleSerAlaCysAsnGlyLeuLys 
302 GGACGCATCGAGAATGATGCllGACTa^TGCAAGCATCAGT^ 

CCTGCGTAGCTCTTACTACGACTGAGTIGACGTTCGTAGTCAaSTACGTTCCCA^ 

' 344 SPHI, 

GlyHisPheLysLeuGlnGlyGluMetTyrLeuIleGluProLeuGlu LeuS erAspSer 
362 GGACATTTCAAGCTTCAAGGGGAGATGTACCTTATTGAACCGTTGGA^ 

CCTGTAAAGTTCGAAGTTCCCCTCTACATGGAATAACTTGGCAACCTCGAAAGGCT^^ 

371 HIND3, 

GluAlaHisAlaV9lTyrLysTyrGluAsaValGluLysGluAspGluAlaProLy$Met 
422 gaagcccatg»gtcta(:aaatatgaaaatgtagaaaaagaggatgaggcccccaaaatc 
cttcgggtacgtcagatgtttatacttttacatctttttctcctactc^ 

CysGlyValThrGlnAsnTrpGluSerTyrGluProlleLysLysAlaPheGlnLeuAsn 
482 TGTGGGGTAACCCAGAATTGGGAATCATATGAGCCCATCAAAAAGGCCTTT^ 
AaCCCCATTGGGTCTTAACCCTTAGTATACTCmSTAGTTTrTCC^ 

•Mt 

487 BSTE2, 507 HDEI/ 525 STUI, 

FIG. 2-1 
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LeuThrProGluGlnGlnArgPheProGlnArgTyrValGlnLeuVallleValAlaAsp 

542 cttactcctgaacaacaaaggttcccccaAagatatgttcagcttgtcatagttg^ 
gaatgaggacttgttctttccaagggggtttctatacaagtosaacag^ 

HisArgMetTyrMetLysTyrAsnAsnAspSerAsnLeuIleArgGlnTrpValHisGln 
6t)2 CACAGAATGTACATGAAATACAATAATGATTCAAATTTGAT^^ 

GTGTCTTACATGTACTTTATGTTATTACTAAGTTTAAACTATTCT6TTACCCATGTAGTT 

IleValAsnThrlleAsnGluIleTyrArgProLeuAsnlleGlnPheThrLeuValGly 
662 ATTGTCAACACTATAAATGAGATTTACAGACCTTTGAATATiCAATTCACACTGGTTGGC 
TAACAGTTGTGATATTTACTCTAAATGTCTGGAAACTTATAAGTTAAGTGTGACCAACCG 

698 SSPI, 

LeuGluIleTrpSerAsnGlnAspLeuIleThrValThrSerValSerHisAspThrLeu 
722 CTAGAAATTTGGTCOACCAAGATTTGATTACCGTGACATCAGTATC^CATGATACTT^ 
GATCTTTAAACCAGGTTCGTTCTAAACTAATGGCACTGTAGTCATAGTGTACTATGA^ 

C AlaSerPheGlyAsnTrpArgGluThrAspLeuLeuArgArgGlnArgHisAspAsnAla 
782 GCCTCATTTGGAAACTGGAGAG AGACAGACTTGCTAAGGCGCCAAAGACATGATAATGC^ 
CGGAGTAAACCTTTGACCTCTCTCTGTCTGAACGATTCCGCGGTTTCTGTACrATT^ 

819 NARI, 

GliiLeuLeuThrAlalleAspPheAspGlyAspThrValGlyLeuAlaTyrValGlyGly 
842 CAGTTACTCACGGCCATTGACTTTGATGGAGACACTGTAGGATTGGCTTATGTGGGCGGT 
GTCiUVTGAGTGCCGGTAACTGAAACTACCTCTGTGACATCCTAACCGAATACACCCGCC^ 

MetCysGlnLeuLysHisProThrGlyVallleGlnAspHisSerAlalleAsnLeuLeu 
• 902 ATGTGCCAACTGAAGCATCCTACAGGAGTTATCCAGGATCATAGTGCAATAAATCTTTTG 
TACACGGTTGACTTCGTAGGATGTCCTCAATAGGTCCTAGTATCACGTTATTTAGAAAAC 

ValAlaLeuThrMetAlaHisGluLeuGlyHisAsnLeuGlyMetAsnHisAspGlyAsn 
962 GTTGCACTTACAATGGCCCATG AGCTGGGTCATAATCTGGGCATG AATCATG ATGG AAAT 
CAACGTGAATGTTACCGGGTACTCGACCCAGTATTAGACCCGTACTTAGTACTACCTTTA 

GlnCysHisCysGlyAlaAsnSerCysValMetAlaAlaValLeuSerAspGlnProSer 
1022 CAGTGTCATTGCGGTGCTAACTCGTGCGTCATGGCTGCTGTGCTAAGTGATCAACCCTCC 
^ GTCACAGTAACGCCACGATTGAGCACGCAGTACCGACGACACGATTCACl^ 

1069 BCLI, 

i LysLeuPheSerAspCysSerLysLysAspTyrGlnThrPheLeuThrValAsnAsnPro 
1082 AAACTATTCAGCGATTGTAGTAAGAAAGACTATCAGACGTTTCTTACGGTTAAT^^ 
TTTGATAAGTCGCTAACATCATTCTTTCTGATAGTCrcCAAAGAATG 

GlnCysIleLeuAsnLysProLeuArgThrAspThrValSerThrProValSerGlyAsn 
1142 CAATGCATTCTCAATAAACCCTTGAGAACAGATACTGTTTCAACTCCAGTTTCTGGAAAT 
GTTACGTAAGAGTTATTTGGGAACTCTTGTCTATGACAAAGTTGAGGTCAAAGACCTOT 

1144 AVA3, 

GluLeuLeuGluAlaOP 
1202 GAACTTTTGGAGGCGTCAGAAGAATGTGACTGTGGCTCTCCTGCAGTCTGCAGC^ 
CTTGAAAACCTCCGCACTCTTCTTACACTBACACCGAGAGGACGTCAGAra 

1242 PSTI, 1249 PSTI, 

1262 CAGTGTGTTGATGTGACTACAGCCTAATAATCAACCTCTGGCTTCTCTC^^ 

GTCaCACAACTACACTGATGTCGGATTATTAGTTGGAGACCGAAGAGAGTCTAAACTAGA 

13 22 TGGAGATCCTTCTTTCAGGAGGTTTGGCTTCCCTGTAGTCCAAAGAGACCC^^ 
ACCTCTAGGAAGAAAGTCXZTCCAAACCGAAGGGAOlTCaGGTTTCTCTGGGTA^ 
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13 82 CATCCTACTAGTAAATCACTCTTAGCTTTCATATGGAATCTA ACTTCTGCAATATTTCTT 
GTAGGATGATCATTTAGTGAGAATCGAAAGTATACCTTAGATTGAAGACGTTATA 

1388 SPBI/ 1411 NDEI, 1432 SSPI/ 

1442 GTCCATATTTAATCIGTAATCAAACCTTTTCCCACCACAAAGCTCTATGTG^^ 

GAGGTATAAATTAGACATTA6TT1SGAAAAGGGTGGTGTTTC6AGATACACGTACATGTT 

1502 CACCAACGGCTTATCTGCTGXCAAGAAAAAAAATGGCCATTTTACCGTTTGC^ 
GTGGTTGCCGAAXAGAOGACAGTTCrrrTTTTTACC 

1535 BALI, 

1562 ATTTAATGCAACAAGTTCTGCCTTTTGAGCTGGTGTATTCGAAGTGAATGTTTA 

TAAATTACGTTGTTCAAGAO^GAAAACXCGACCACATAAGCTTCACTTACAAATGAGAG^ 

1599 ASU2, 

1622 CaAAATTTCAlGCTGGCTTTCCAAGAOXSTAGCTGCTTCCGT^ 

GTTTTAAAGTACGACCGAAAGGTTCTACATaSAOGAAGGCAGTTAT^ 

1-682 ATTAAAAAAAAAAAA 



FIG. 2-3 
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FIG. 3-1 



Fib48 
FibSl 

Fib48 
FibSl 

Fib48 
FibSl 

Flb48 
FibSl 

Fib48 
FibSl 

Fib48 
FibSl 

Fib48 
FibSl 

Fib48 
FibSl 

Fib48 
FibSl 

Fib48 
FibSl 

Fib48 
FibSl 



70 80 90 100 110 120 

AAOXaATCCAGGTTCTCTTGGTGACTATATGCTTAACAGCT^ 

* * 

GGXAAGllATATGCTIAGCAGCTTTTCCTTATCAAGGGAGCTC 
10 20 30 40 ■ 

130 140 150 160 170 180 

TATMTCCTGGAATCTGQGAACGTGAATGATTATGAAGTAGTGTATC^^ 

★ ♦ 
TATAATCXTGGAATCTGGGAACGTTAAXGATTATGAAGTAGTGTATCCACGAAAAGTCAC 
50 60 70 80 90 100 . 

190 200 210 . 220 230 240 

TCCAGTGCCCAGAGGAGCAGTTCAGCCAAAGTATGAAGATGCCATC 

* * 
TCCAGTGCCCAGAGGAGCAGOrraiGCCAAAGTATGAAGATGCCATGCAATATGAATT^ 
110 120 130 140 150 160 

250 260 279 280 290 300 

AGTGAATGGAGAGCXIAGTGGTCCTXCACCaX^GAAAAAAATAAAQGACTTTTTl^ 

AGTGAATGGAGAGCCAGTGGTCCTTCACCTGGAAAAAAATAAAGGACTTTTTTCAGAAGA 
170 180 190 200 210 220 

310 320 330 340 350 360 

TTACAGOGAGACTCATTATTCCCCTGATGGCAGAGAAATTACAACATAC^^ 

TTACAGCGAGACTCATTATTCCCCTGATGGCAGAGAAATTACAACATACCCCCTGG 

230 240 250 260 270 280 

370 380 390 . ' 400 410 420 

GGATO^CTGCTATTATCATGGACGOVTCGAGAATGATGeTGACTC^ 

GGATO^CTGCXATTATCATGGACGCATCGAGAATGATGCTGACTCAACTGC^ 

290 300 310 320 330 340 

430 440 450 460 470 480 

TGC^TCC^AGGGTTTGAAAGGAOlTTTCAAGCrTCAAGG^ 

TGCATGCAACGGTTTGAAAGGACATTTC»AGCTTC»AGGGGAGATGTACCTO^ 

350 360 370 380 390 400 

490 500 510 520 530 540 

ATOX^GAGCTTTCQGACAGTGAAGCCCATGCAGTCTACAAATATGAAAATGTAGAAi^ 

GTTC6AGCTTTCCGACAGTGAAGCCCATGCAGTCTACAAATATGAAAATGTAGAAAAAGA 
410 420 430 440 450 460 

550 560 570 580 590 600 

GGATGAGGCCCXCAAAATGTGTGGGGTAACCCAGAATTGGGAATCAT^ , 

GGATGAGGCCCCCAAAATGTGTGGGGTAACCCAGAATTGGGAAIO^TATC^ 

470 480 490 500 510 520 

610 620 ' 630 640 650 660 

iVAAGGCCTrtCAGTTAAATCTTACTCCTGAACAACA^ 

AAAGGCCTTTCAGTTAAATCr^CTCCTGAACAACAAAGGTTCC^^ 

540 550. 560 570 580 



530 



720 



670 680 690 700 710 

GCTTGTO^TAGTTGCGGATCACAGAATGTACACGAAATACAA 

^ f( Hit * 

GCTTGTCATAGTTGCAGATCACAGAATGXAOVTGAAATACAATAATGArrCAAAT^ 
590 ISOO 610 620 630 640 
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730 740 750 760 770 780 

Flb48 AAGACAATGGATATATCGiUVIX^GTCAACACTATiU^TGAGATTTACAGAC^^ 

* * ♦ * 

FibSl AAGAC^ATGGGTACATCAAATTGTCAACACTATAAATGAGATl^ 

650 660 670 680 690 700 

790 800 810 820 830 840 

Fib4 8 TCAATTCGTACTCGTTGGCCTAGACATTTGGTCCAA^ 

FlbSl TCAATTCAOVCTGGTTGGCCTAGAAATTTGGTCCAACCAAGATO 

710 720 730 740 750 760 

850 860 870 880 890 900 

Fib48 AGTATCACATGATACTTTGGCXTCATTTGAAAACTGGAGACAGACAGATTTCCT^ 

★ * ★ * ** 

FibSl AGTATCACATGAXACXTTGGCCTCATTTGGAAACTGGAGAGAGACiU^ACTTGCT 

770 780 790 800 810 820 

910 • 920 930 940 950 960 

Fib48 CAAAAGTCATGATAATGCCCAGTTACXCACGGCCATTGTCTTCGATGAAGGAATTATAG^ 

******* 

* Fib51 CCAAAGACATGATAATGCCCA6TTACTCACGGCCATTGACTXTGATGGM 

830 840 850 860 870 880 

970 980 ' 990 1000 1010 1020 

Flb48 AAGAGCTCCCCTAGCCGGCATGTGTGACCCGATCTTTTCTGXAGGAATTGTTGA^ 
*** **♦* * * * * * ** 

Fib51 AXXGGCXXAXGIGGGCGGXAXGTGCCAACXGAAGCAXCCXACAGGAGTXAXCCAGGAXCA 
890 900 910 920 930 940 

1030 1040 1050 1060 1070 1080 

,Flb48 XAGXGCAAXAAAXCXXXXGGXXGCACXXACAAXGGCCCAXGAGC^ 

Fib51 XAGXGCAAIAAAXCXXXXGGXXGCACXXACAAXGGCCCATGAGCXGGGXC^^ 

950 960 970 980 990 1000 



1090 1100 1110 1120 1130 1140 

CAXGGAXCAXGAXGGAAAXCiUSXGXCAXXGCGGXGCXiUVCXCGXGCGXXAXGG^ 



**** 



Flb48 

Flb51 CAXGAAXCAXGAXGGAAAXCAGXGXCAXXGCSGXGCXAACXCGIGCGXCAXG^ 

1010 1020 1030 1040 1050 1060 

1150 1160 1170 1180 1190 1200 

Fib4 8 ACXAAGXAAXCAACCXXCCAAACXAXXCAGCGAXXGXAGXAAGAAAXACXAX^ 

* * * * * 

Fib51 GCXAAGXGAXCi^CCCXCCAAACXAXXCAGCGAXXGXAGXAAGAAAGACXAXCAGACGXX 
1070 1080 1090 1100 1110 1120 

1210 1220 1230 1240 1250 1260 

Fib4 8 XCXXAAGGXXAAAAACCCACAAXGCAXXCXCAAXAAACCCXXGAGAACAGAXACTGXXXC 

Flb51 XCXXACGGXXAAXAACCCAOU^XGOVXXCXCAAXAAACCCXXGAGAACAGAXACXGXXXC 
1130 1140 1150 1160 1170 1180 

1270 1280 ' 1290 1300 1310 1320 

Fib48 AACXCCAGXlXCXGGAAAXGAACXXXXGGAGGCGXGAGAAGAATC 

FibSl AACXCOUSXXXCTGGAAAXGAACXXXXGGAGGCGXGAGAAGAAXGX^ 

1190 1200 1210 1220 1230 1240 

1330 1340 1350 1360 1370 1380 

Fib48 XGCAGXCXGCMOUiaWSGCAGXGXGXXGAX^ 

FibSl XGCAGXCXGCAGCAACAGGCAGXGXGXXGAXGXGACXACAGCCIAAXAAXCA^ 

1250 1260 1270 1280 1290 1300 
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1390 1400 1410 1420 1430 1440 

Flb48 CXTCTCTCAGATTT6ATCTTGGA6ATCCTTCTTTCAGAAG6TTTGGCTTCCCTGTAS 

FibSl CTTCTCTCAGATTTGATCTTGGAGATCCTTCTirrCAGGAGGT^^ 

1310 1320 1330 1340 13S0 1360 . " 

1450 1460 1470 1480 1490 1500 

Fib4 8 AAAGAGACOyiTCTGCCTGCATCCTACTAGTAAATCaCTCTTAGCTT^ 

Fib51 AAAGAGACCCATCTGCCTGCATCCTACTAGTAAATCACTCTTAGCTTTCAT^ 

1370 1380 1390 1400 1410 1420 

1510 1520 1530 1540 1550 1560 

Fil54 8 AACTTCTGCAATATTTCTTCTCCATATTTAATCTGTTTACCTTTT<^ 

************** 

Fib5 1 AACTTCTGCAATATTTCTTCTCCATATTT AATC TGTAATCAAACCT 

1430 1440 1450 1460 • 

1570 1580 1590 1600 1610 1620 

Fib4 8 TTTCCeACCACAAAGCTCTATGGGCATGTACAACACCAACGGCTTATCTGCTG^ 

* 

FlbSl TTTCCCACCACAAAGCTCTATGTGCATGTACAACACCiUVCGGCT^ 

1470 1480 1490 1500 1510 1520 

1630 1640 1650 1660 1670 1680 

Fib4 8 AAAAAATGGCCAT!mACCGTTTGCCAAAGCACATT^ 

Fib51 AAAAAAIGGCCAXXXTACCGXTIGCCAAAGCAGATTTAATGCAAC^ 

1530 1540 1550 1560 1570 1580 

1690 1700 1710 1720 1730 1740 

Fib4 8 AGCTGGIGTATTCGAAGTGAATGTTTACTCTCXCAAAATTTCATGC^^ 

* 

FibSl AGCTGGTGTATTCGAAGTGAATGTTTACTGTCCCAAAATTTCATGCTGGCTTTC-^^ 
1590 1600 1610 1620 1630 1640 

1750 1760 1770 1780 

Fib4 8 TGraGCTGCTTCCGTCAATAAACTAACTATTCTCATTCAAAAAAAA 

* 

FibSl TGOy^GCTGCTTCCGTCAATAAACTAACTATTCTCATTAAAAAAAAAAAA 
1650 1660 1670 1680 1690 



94.3% ldentlt7 in 1708 bp overlap 
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10 20 30 40 50 60 

Flb48 MIQVlXVTICLTAFPYQGSSZILESGNVin:^ 

* . * * 

FibSl VTICXAAFPYQGSSIILESGirm^YEVVYPRKVTP^ 

10 !20 30 * 40 SO 

70 80 90 100 110 120 

Flb48 VtlGEPVVLHLEKNKGirSEDYSETHYSPOGREITTYPL^ 

FibSl VNGEPVVLHLEm;GLFSEDYSETHYSPIX3REITTYPLVEDHC^ 

60 70 80 90 100 110 

130 140 150 160 170 180 

Fib48 ACNGLKGHFKL(KEim-IEPLELSDSEAHAVYKmiVEK^ 

Flb51 kCUGI^GlXFKSJXEtmjlEPl^ 

120 130 140 150 160 170 

190 200 210 220 230 240 

Fib48 KAFQLNLTPEQQGFPQRYVELVIVADHRKniKYN 

FibSl KAFQI^TPEQQRFPQRYVQLVIVADHRMYM^ 

180 190 200 210 220 230 

250 260 270 280 290 300 

Flb48 QFVLVGLDIWSKXDLSTVTSVSHDTLASFENWRQ 

***** ***** * **** 

FibSl QFOliVGLEIWSNQDLITVTSVSHDIXi^FGNWRETD 

240 250 260 270 280 290 

310 320 330 340 350 360 

Fib4 8 i^PIJVGMCDPMFSVGIVEDHSAINLLVALTMAHEIUSpN^ 

*** ****** *** * ** 

FibSl lJiT^<SGM£QIJf3i^ 

300 310 320 330 340 350 

370 380 390 400 410 

Fib48 LSNQP.SKLFSDCSKKYYQKFIJCVKMPQCZLN^ 
* ***** 

FibSl LSIXIPSKLFSIXZSKKDYQTFLTVNNPQCILN^ 

360* 370 380 390 400 



87.9 % identity in 40S residue overlap 

FIG. 4 
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jATGTTTTGTTT TAG C|S G AAA GTC 

Blndlll StuZ 
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GACGTCTCTAAAGCCAGCT 



PstZ 



Sail 



ligate gene which encodes 231/aa 
to promoter and ■ to terminator 
uln pBR322 A RI Sal at BamRI site 



ADBj* GAPDB promoter FIBROLASE 
BajQBI Bindll l Bin dlll 



Sail 
1 



1341 bp 
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F248 derived from SVG 48 
r251 derived from SVG 51 
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Neu eingsreicht / Newiy filed 
. . N'ouvellement (iSpos6 



c 



c 



1 GATCCTTK:yuVTATCCGCA(»TA(XCIX3TTATGTTC^ 

CTAGGAAGTTATAGGGGTGTATGCGACAATACAAGTTC^GGGM 

i.3 FSPI MSTIr 

61 GCGGTCrrCCTTTTGAGGGATCTTT(»AGT 

CGCCAGAAGGAAAACTCCCTACAAAGrrcAACAAGTn 

121 AGTCTGTATCIAGAGCGTTGAATCGGTGATGCXSATTTGTTAAOT 

TCAGACATAGATCTCGCAACTTAGCCACTACGCTAAACAATtAAT^ 

129 XBAI, 

181 CATTACCAGGTCTAGATATACCAATGGCAAACnxSAGCACAACAATACCAGTC^ 
GTAATGGTCCAGATCEATATG6TTACCGTTTGACTCX3T6TTGTTA 

191 XBAI^ 

241 CTGGCACCATCTCTCCCGTAGTCTCATCTAATTTTTCTTCW^ 

GACa:TGGTAGift3AG6GCATCAGAGTAGATTAAAAAGAAGGCCTACTCCAAGGTCTAl:AT 

301 CCGCAACACCTTTATTATGGTTTCCCTGAGGGAATAATAGAATGTCC^ 

GGCGTTGTGGAAATAATACCAAAGGGACTCCCTTATTATCTTACAGGGTAAGCTTTAGTG 

325 MST2, 350 ASU2, 

361 CAATTCTAAACCTGGGCGAATTGTArrrCGGGTTTGTTAACTC^^ 
GTl»AGATTTGGACCa3CTTAACATAAAGCCCAAACAAT^ 

396 HFAI, 

421 TCCA(»TGAAGCrATCTTCCAGCAAAGTCTCCACTTCTT(^ 
AGGTGCACTTOSAXAGAAGGICGTl^ 

481 CCCA ATGCTCTTATCTATGGGACTTCCGGGAAACACAGTACCGAT^^ 

GGGTTACGAGAATAGATACCCnXJAAGGCCCTTrGTGTCATGGCTATGAATC^ 

541 T1X»GAGCTCATTGTTTGTTXGAAGAGACTAATCAAAGAATCGTTTTCTCAAAAAAATTA 
AAGTCTCGAl8TAI^AACAAAC1!!ECTCT6ATTAGTCT 

545 SACI/ 

601 ATATCTTAACTGATAfiraXGATWAAGGGGCAAAACGTAGGGGCAAACAAACGGAAAA^ 
TATAGAATaJGACTATCAAACTAGTT^ 

619 BCLI, 
661 CGTTTCTCAAAOTTTCEGAl?^ 



FIG. 6-1 
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Neu einc 



GreicM / wowiy \iiecJ \ 



KouveilernentdepcGO^ 



6CAAAi6AGTTTAAAA6ACaAC6GTIC3TO 

721 GATCCGTCTCTCCGGTTACAGCCTGTGTAACTCATTAA 

CTAGGCA6AGAG6CCAAIGT(3;6ACACJVTTGACTAATTAGGAC^ 

781 CTi^TCTTTTAATTAAGGGATTTTGTCrrCATTAAOT^ 

GATTACAAAATTAATTOCCXAAAACAGAAGTAATTGCCGAAAGCGACT^ 

84 1 GACGTTTTGCCCGCAaX!GGGAAACCATCCACTaX»CGAGACTGATCTC 
CTGCAAAACGGGCGTCCGCCajaTOGTAGGTGAAGTGCTCTGAC^^ 

901 CACCGGGCAlXnXX:AACTrATAAGTTGGAGAAATAAGAGAATTTCAGAT^ 
GTGGCCOjTAGAGGTTGAATATTCAACrTCTTTATTCTCTTAAAGTCT^ 

961 AAAAAAAAAACCCTTAGTTCATAGGTCCAO^CTCTTAGCGCAACTACAGAGAACAG^ 
TTTTTTTTTTGGGAATCAAGTATCCAGGTAAG^ 

1021 ACAAACAGGCAAAAAACGGGCACAACCTCAATGGAGTGATGC^ 
TGTTTGTCCGTTTTTTGCCCGTGTTGGAGTTACCTCACXACG 

1081 GATGACACAAGGCAATTGACCCACGCATGTATCTATCTCAXTTTCTTAOKCCTT^ 
CTACTGTGTTCGGTTAAClXyGGTGGGTACAXAGATAGAGTAAAAGAATGT^^ 

U4 1 CCTTCTGCTCTCTCTGATTTGGAAAAAGCTGAAAAAAAAGGTTGAAACC^ 
GGAAGAC6AGAGAGACTAAACCXTTTTCGACXTTTTTTTC 

1198 XMNI, 

1201 ATTATTCCCCTACTTCACTAATAAGTATATAAAGACGGTAGGTATTGATTGTAATTCTGT 
TAATAAGGGGATGAACTCATTATTCATATATTTCTGCCATCCATAACTAAC^ 

1261 AAATCTATTTCTTAAACTTCTTAAATTCTACTTTTATAGTTAGTCTTTT^^ 

TTXAGATAAAGAATTTGAAGAATTTAAGATGAAAATATCAATCAGAAAAAAAATCAA^ 

1317 AHA3, 

1321 AAACACCAAGAJVCTTAGTTTCGAATAAAC^CACATAAACAAACAAGCTT 
TTT6TGGTTCTTGAATCAAA6CTXATTTGIG«>TAX7XGTTTGTTCGAA 

1339 ASU2, 1364 HIND3, 



FIG. 6-2 
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